Goto

Collaborating Authors

 interventional distillation



ComprehensiveKnowledgeDistillation withCausalIntervention

Neural Information Processing Systems

Although theteacher haslearned rich and powerful representations, it also contains unignorable bias knowledge which is usually induced by the context prior (e.g., background) in the training data.